Automatically Adapting the Structure of Audio Similarity Spaces
نویسندگان
چکیده
Today, among the best-performing audio-based music similarity measures are algorithms based on Mel Frequency Cepstrum Coefficients (MFCCs). In these algorithms, each music track is modelled as a Gaussian Mixture Model (GMM) of MFCCs. The similarity between two tracks is computed by comparing their GMMs. One drawback of this approach is that the distance space obtained this way has some undesirable properties. In this paper, a number of approaches to correct these undesirable properties are investigated. They use knowledge about the properties of music by using other music tracks as a reference. These reference tracks can either be the music collection itself, or they may be an external set of reference tracks. Our results show that the proposed techniques clearly improve the quality of this audio similarity measure. Furthermore, preliminary experiments indicate that the techniques also help to improve other similarity measures. They may even be useful in completely different domains, most notably text information retrieval.
منابع مشابه
Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)
Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...
متن کاملDomain Selection and Adaptation in Smart Homes
Recently researchers have proposed activity recognition methods based on adapting activity knowledge obtained in previous spaces to a new space. Adapting activity knowledge allows us to quickly recognize activities in a new space using only small amounts of unlabeled data. However, adapting from dissimilar spaces not only does not help the recognition task, but might also lead to degraded recog...
متن کاملAdaptive Temporal Modeling of Audio Features in the Context of Music Structure Segmentation
This paper describes a method for automatically adapting the length of the temporal modeling applied to audio features in the context of music structure segmentation. By detecting regions of homogeneous acoustical content and abrupt changes in the audio feature sequence, we show that we can consequently adapt temporal modeling to capture both fastand slowvarying structural information in the au...
متن کاملSemiautomatic Benchmarking of Feature Vectors for Multimedia Retrieval
Modern Digital Library applications store and process massive amounts of information. Usually, this data is not limited to raw textual or numeric data – typical applications also deal with multimedia data such as images, audio, video, or 3D geometric models. For providing effective retrieval functionality, appropriate meta data descriptors that allow calculation of similarity scores between dat...
متن کاملAdapting Metrics for Music Similarity Using Comparative Ratings
Understanding how we relate and compare pieces of music has been a topic of great interest in musicology as well as for business applications, such as music recommender systems. The way music is compared seems to vary among both individuals and cultures. Adapting a generic model to user ratings is useful for personalisation and can help to better understand such differences. This paper presents...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006